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SUMMARY 


Background 

Running  economy  (RE)  is  an  efficiency  index.  Greater  economy  means  that  a  person  uses  less 
energy  to  run  a  given  distance.  Given  two  equally  fit  runners,  the  more  economical  will  run  a 
given  distance  faster  than  the  less  efficient  runner.  For  two  runners  with  different  levels  of 
fitness,  a  less  fit  runner  may  be  able  to  use  greater  economy  to  outperfonn  a  fitter,  less 
economical  runner.  Accurate  measurement  of  RE  is  necessary  to  quantify  its  effects  precisely. 

Objective 

Standard  RE  measures  are  based  on  the  assumption  that  oxygen  consumption  rate  is  a  linear 
function  of  running  speed.  The  slope  of  the  line  (bi),  which  is  the  amount  of  oxygen  needed  to 
run  1  unit  of  distance,  defines  RE.  If  the  intercept  of  the  function  (bo)  is  zero,  the  oxygen  uptake 
rate  measured  at  any  running  speed  can  be  divided  by  speed  to  obtain  an  unbiased  estimate  of  bj. 
Nevill,  Cooke,  Holder,  Ramsbottom,  and  Williams  (1992)  presented  evidence  that  bo  ^  0. 
Instead,  b0  varies  from  person  to  person.  The  objective  of  this  study  was  to  further  evaluate  the 
bias  in  RE  measures  implied  by  this  finding. 

Methods 

Data  came  from  5  published  studies  that  reported  steady-state  oxygen  uptake  for  individual 
participants  who  had  completed  running  bouts  at  3  or  4  different  submaximal  speeds.  The 
measured  oxygen  uptake  (mlCk)  was  modeled  as  a  linear  function  of  running  speed  (s)  (i.e., 
ml02=  bo  +  bi*s).  Analysis  of  covariance  (ANCOVA)  and  hierarchical  linear  modeling  (HLM) 
evaluated  both  the  intercepts  (bo)  and  slopes  (bi)  for  this  model. 

Results 

Two  mathematical  models  provided  reasonable  summaries  of  the  data.  One  model,  which 
combined  individual  differences  in  b0  with  a  fixed  bi,  replicated  the  Nevill  et  al.  (1992)  findings. 
The  other  model,  which  combined  a  fixed  bo  with  individual  differences  in  bi,  indicated  that  a 
simple  modification  would  eliminate  bias  from  standard  RE  measurement  methods.  The 
modification  would  subtract  3.18  mlCfykg  •min'1  from  measured  oxygen  uptake.  The  second 
model  was  only  slightly  better  in  explaining  the  variation  in  oxygen  consumption,  but  it  was 
supported  by  HLM  findings  indicating  that  bi  varied  significantly  across  individuals  while  bo  did 
not.  Even  these  modest  differences  were  infonnative  in  light  of  the  very  strong  correlation  (r  = 
.935)  between  individual  differences  in  bo  and  individual  differences  in  bi. 

Conclusions 

Logical  considerations  also  favor  the  second  model.  For  example,  the  model  with  variable  bo 
would  require  an  explanation  for  the  fact  that  some  individuals  had  negative  resting  oxygen 
uptake  rates,  a  physiological  impossibility.  Combining  logic  with  the  empirical  evidence,  the 
conclusion  is  that  a  slight  modification  the  standard  approach  yields  unbiased  RE  estimates.  The 
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RE  modification  combines  individual  differences  in  bi  with  a  fixed  bo  value.  The  combination 
yields  an  adjusted  RE  based  on  net  oxygen  uptake.  The  adjustment  based  on  the  present  evidence 
would  be  3.18  mlC^-kg  -min'1,  so  adjusted  RE  =  (mlCE  -  3.18)/s.  Other  research  suggests  that  the 
proper  adjustment  may  be  in  the  range  from  3.6  to  5.0  mlOo-kg^-min"1.  Further  study  to  better 
define  the  required  adjustment  could  be  useful.  A  review  of  existing  research  found  adjusted  RE 
measures  in  about  1  of  every  12  studies.  The  present  findings  suggest  that  this  practice  should  be 
the  standard  approach  instead  of  a  rarity. 


Introduction 


The  oxygen  cost  of  running  (OCR)  is  the  oxygen  consumed  in  running  a  given  distance. 
This  cost,  which  is  typically  expressed  as  ml02'kg'1km'1  or  mlOrkg-m  ,  is  assumed  to  be  a 
fixed  value  determined  by  the  distance  covered.  OCR  is  assumed  to  be  independent  of  the  speed 
at  which  the  distance  is  covered.  These  assumptions  are  based  on  a  linear  relationship  between 
the  rate  of  oxygen  uptake  and  running  speed  for  speeds  above  8  km-hr"1  (McArdle,  Katch,  & 
Katch,  2001). 

Individual  differences  in  OCR  are  a  common  research  topic.  Running  economy  (RE)  is  the 
label  given  to  these  differences.  An  individual  with  a  lower  value  for  OCR  (i.e.,  greater 
economy)  will  expend  less  cumulative  energy  to  run  a  given  distance  than  will  an  individual  with 
higher  OCR.  The  RE  label  highlights  this  difference  in  efficiency  and  directs  attention  to  an 
important  attribute.  RE  can  explain  why  performance  is  imperfectly  linked  to  maximal  oxygen 
uptake  capacity.  Given  two  runners  with  equal  maximal  capacity,  the  more  economical  of  the 
two  will  be  able  to  convert  this  capacity  into  faster  run  times.  Given  two  runners  with  different 
maximal  capacities,  the  one  with  less  capacity  may  perform  better  than  his  or  her  counterpart  if 
he  or  she  is  more  economical. 

The  belief  that  OCR  is  independent  of  running  speed  simplifies  the  measurement  of  RE.  A 
person  can  run  at  a  submaximal  speed  long  enough  to  reach  steady-state  oxygen  uptake.  The  RE 
estimate  is  the  steady-state  uptake  divided  by  the  running  speed.  If  OCR  is  independent  of  speed, 
this  ratio  will  be  the  same  no  matter  what  running  speed  is  chosen  for  the  test.  The  only 
requirement  is  that  the  speed  should  be  >8  km-hr'1,  the  approximate  speed  at  which  the  preferred 
mode  of  locomotion  changes  from  walking  to  running  (McArdle  et  ah,  2001). 

The  belief  that  OCR  is  independent  of  running  speed  is  not  likely  to  be  literally  correct. 

The  general  equation  for  a  linear  relationship  is  y  =  bo  +  bix.  Substituting  111IO2  as  the  dependent 
variable  and  speed  (s)  as  the  predictor,  this  general  equation  becomes  ml02  =  b0  +  b(s.  OCR  is 
independent  of  speed  only  if  bo  =  0.  In  this  special  case,  ml02  =  bi*s.  Because  OCR  =  ml02/s, 
the  special  case  of  the  general  equation  can  be  rewritten  to  yield  OCR  =  ml02/s  =  bi.  The  last 
tenn  in  this  sequence  of  equalities  does  not  include  speed,  so  OCR  has  the  same  value  (i.e.,  bi) 
regardless  of  speed.  Thus,  OCR  is  independent  of  speed  only  in  the  special  case  of  bo  =  0. 
Empirical  instances  of  the  special  case  would  be  an  exception  to  the  more  common  finding  that 
b0  7^  0  (c.f,  Leger  &  Mercier,  1984;  Nevill  et  ah,  1992). 

The  bias  in  standard  RE  measures  when  bo  7^  0  is  readily  evident.  In  this  case,  ml02/s  =  bi 
+  bo/s.  The  second  term  on  the  right  hand  side  of  the  equation,  bo/s,  is  the  bias.  Note  that  bias 
does  not  necessarily  affect  the  assessment  of  RE.  If  bo  =  k  for  all  individuals,  bias  is  the  same  for 
all  individuals.  In  this  case,  bias  will  affect  the  average  cost,  i.e.,  OCR,  but  the  variation  of 
individual  values  about  that  average  will  not  be  affected.  Thus,  standard  OCR  measurements 
translate  into  proper  RE  estimates  if  bo  ~  k  for  all  individuals. 

Nevill  et  al.  (1992)  presented  evidence  that  bo  differs  substantially  between  individuals.  In 
their  study,  each  participant  perfonned  3  submaximal  runs  at  different  speeds.  Steady-state  111IO2 
was  measured  at  each  speed.  The  data  were  analyzed  by  ANCOVA.  Oxygen  uptake  rate  was  the 
dependent  variable.  Participant  was  the  group  variable.  Speed  was  the  covariate.  The  pooled 
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regression  of  ml02  on  speed  accounted  for  79%  of  the  variance.  Individual  differences  in  bo 
accounted  for  18%  of  the  VO2  variance. 

Nevill  et  al.’s  (1992)  findings  raise  doubts  about  the  status  of  the  RE  as  a  theoretical 
construct.  In  their  analysis,  RE  would  be  evident  as  individual  differences  in  bi.  These 
differences  accounted  for  only  0.4%  of  the  variance  in  mlCfi.  Although  statistically  significant  (p 
<  .023),  considerations  of  parsimony  would  exclude  RE  from  the  resulting  model.  Parsimony  can 
be  defined  as  the  number  of  parameters  in  a  model  (Popper,  1959).  In  this  case,  including  the 
individual  differences  in  bi  that  define  RE  would  approximately  double  the  number  of 
parameters  to  increase  explanatory  power  by  less  than  1%.  This  trade-off  in  complexity  versus 
explanatory  power  is  not  sufficient  to  justify  including  RE  in  the  final  model.1 

The  model  implicit  in  standard  RE  measurement  practices  differs  markedly  from  the  model 
implied  by  Nevill  et  al.’s  (1992)  findings.  The  difference  can  be  described  by  considering  the 
pattern  of  lines  that  would  be  expected  if  the  ml02  were  plotted  as  a  function  of  speed  for  each 
individual  in  a  sample.  The  standard  method  assumes  that  individual  differences  would  be 
expressed  as  differences  in  the  bi  values  for  the  lines.  If  all  of  the  lines  converged  to  zero  when  s 
=  0,  these  differences  in  bi  would  yield  a  fan-shaped  array  of  lines.  This  possible  outcome  is 
referred  to  as  the  fan  model  in  the  remainder  of  this  paper. 

Nevill  et  al.’s  (1992)  findings  imply  a  very  different  picture.  The  variance  explained  by 
differences  in  bi  is  so  slight  that  slope  can  be  regarded  as  a  constant  for  all  participants.  The 
variance  explained  by  differences  in  bo  is  relatively  large,  so  a  plot  of  the  data  for  each  individual 
would  show  a  set  of  parallel  lines.  The  lines  would  be  parallel  because  they  would  all  share  the 
common  value  for  bi,  but  differences  in  bo  would  separate  the  lines  for  different  subjects.  The 
differences  in  bo  might  have  a  physiological  interpretation.  These  differences  might,  for  example, 
be  equated  with  differences  in  resting  metabolic  rate  or  resting  energy  expenditure.  These 
suggestions  are  only  illustrative,  because  bo  differences  really  refer  to  estimated  ml02  associated 
with  a  hypothetical  behavior  (i.e.,  running  at  s  =  0).  This  behavior,  if  possible  at  all,  might  not 
equate  with  any  current  physiological  construct.  Still,  Nevill  et  al.’s  findings  imply  that  basal 
uptake  differences  are  the  primary  source  of  individual  differences  in  oxygen  uptake  while 
running.  This  perspective  is  referred  to  as  the  parallel  lines  model  in  the  remainder  of  this  paper. 

Nevill  et  al.’s  (1992)  findings  supported  the  parallel  lines  model  over  the  fan  model.  This 
study  focused  on  2  factors  that  might  alter  the  choice  of  models.  First,  Nevill  et  al.’s  (1992) 
findings,  like  those  of  any  study,  could  be  the  product  of  chance.  Second,  their  statistical 
procedures  may  have  been  biased  in  favor  of  the  parallel  lines  model.  ANCOVA  procedures 
often  involve  a  sequential  decomposition  of  variance.  The  sequence  determines,  in  order,  the 

’individual  differences  in  bi  did  produce  a  statistically  significant  increase  in  the  explanatory  power  of  the 
model.  However,  statistical  significance  is  a  poor  guide  for  modeling  decisions  in  this  case.  Statistical 
significance  meant  that  the  variance  explained  by  differences  in  b,  was  greater  than  expected  by  chance. 
The  outcome  was  obtained  only  because  the  overall  model  accounted  for  almost  all  of  the  ml02  variance. 
The  small  residual  variance  translated  into  a  small  denominator  for  the  F-test  that  was  the  basis  for 
determining  statistical  significance.  Because  the  error  term  was  small  relative  to  the  overall  variance,  even 
trivial  explanatory  power  could  yield  a  large  enough  F-test  to  classify  the  explanatory  power  as  greater 
than  expected  by  chance.  The  fact  that  differences  in  b0  accounted  for  45  times  as  much  variance  as 
differences  in  bj  is  a  better  barometer  of  the  importance  of  bi  variation. 
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variance  explained  by  a  common  regression  line,  by  group  differences  in  the  intercept  of  the 
regression  line,  and  by  group  differences  in  the  slope  of  the  regression  line.2  This  sequence  will 
affect  inferences  in  the  present  case  if  differences  in  bi  correlate  with  differences  in  bo. 
Correlated  differences  imply  overlapping  explanatory  power.  The  default  sequence  assigns  any 
overlapping  explanatory  power  to  the  bo  differences.  From  a  modeling  perspective,  there  is  no  a 
priori  reason  why  the  variance  should  be  partitioned  in  this  manner.  Some  or  all  of  the 
overlapping  explanatory  power  could  be  attributed  to  bi.  Any  reallocation  would  obviously 
affect  conclusions  about  the  importance  of  bi  differences.  The  implication  is  that  Nevill  et  al.’s 
(1992)  statistical  model  may  have  been  biased  implicitly  against  retaining  the  RE  construct. 


Table  1.  Descriptions  of  Samples 


N 

Gender 

Age 

Weight 

(kg) 

VCEmax 

#  of 
Speeds 

Range  of 
Speeds 

A 

16 

6 

35 

63.1 

66.6 

3-4 

215-322 

Bi 

8a 

S 

18.4 

65.8 

59.7 

3-4 

249-360 

b2 

6 

? 

16.5 

51.5 

47.0 

4 

220-290 

Ci 

4 

<J 

32 

68.8 

69.3 

3 

214-268 

c2 

4 

? 

25 

54.0 

59.2 

3 

188-241 

Di 

6 

6 

10 

30.4 

NAb 

3 

133-183 

d2 

6 

S 

33 

77.4 

NAb 

3 

133-183 

E 

10 

_ & _ 

26 

NA- 

61.7 

3 

161-210 

Note.  Studies  were  A  =  Costill,  Thomason,  &  Roberts,  1973;  B  =  di  Prampero  et  al.,  1993;  C  = 
Morgan  et  al.,  1994;  D  =  Thorstensson,  1986;  E  =  Williams,  Krahenbuhl,  &  Morgan,  1991. 
Blank  spaces  indicate  that  the  infonnation  was  not  reported. 
aTwo  males  who  ran  at  only  two  velocities  were  dropped. 
bInfonnation  not  available. 


This  paper  examines  both  possible  explanations  for  the  Nevill  et  al.  (1992)  findings.  The 
examination,  which  relies  on  published  data  from  5  studies,  establishes  3  primary  points: 

•  Nevill  et  al.’s  findings  replicate. 

•  Individual  differences  in  bi  are  strongly  correlated  with  individual  differences  in  bo. 

•  Oxygen  uptake  can  be  predicted  about  equally  well  by  a  model  with  differences  in 
either  b0  or  bi. 

The  discussion  examines  the  reasons  for  preferring  the  fan  model  to  the  parallel  lines  model  in 
light  of  these  findings.  Because  the  fan  converges  at  a  non-zero  value  for  the  common  bO,  the 
discussion  also  considers  the  modification  of  standard  RE  measurement  methods. 


2Nevill  et  al.  (1992)  did  not  explicitly  state  that  their  analysis  employed  the  sequential  sums  of  squares 
(SS)  approach.  However,  the  total  SS  for  the  model  equals  the  sum  of  the  SS  for  the  pooled  regression, 
individual  differences  in  bo,  and  individual  differences  in  bi.  This  equivalence  is  unlikely  if  alternatives  to 
the  sequential  SS  procedure  (e.g.,  unique  SS)  were  used. 
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Methods 


Data  Sources 

The  PubMed  database  was  searched  with  “running  economy”  and  with  “oxygen  cost  AND 
running”  as  search  terms.  The  reference  lists  from  articles  identified  in  the  PubMed  search  were 
reviewed  to  identify  additional  candidate  studies.  Table  1  describes  the  5  studies  (Costill  et  ah, 
1973;  di  Prampero  et  ah,  1993;  Morgan  et  ah,  1994;  Thorstensson,  1986;  Williams  et  ah,  1991) 
that  reported  the  data  for  individuals  who  completed  submaximal  runs  at  3  or  more  speeds.  The 
60  participants  in  those  studies  formed  the  sample  for  the  present  analyses. 

Experimental  Procedures 

The  methods  used  in  the  studies  were  broadly  similar.  The  studies  all  had  participants 
perform  3  or  4  runs  at  submaximal  speeds.  Each  run  lasted  long  enough  to  ensure  that  oxygen 
uptake  reached  steady-state.  The  oxygen  consumption  rate  was  measured  for  2  min  at  the  end  of 
the  run  in  4  studies.  The  measurement  period  varied  from  28  sec  to  70  sec  in  the  fifth  study  (di 
Prampero  et  ah,  1993).  Table  1  provides  additional  descriptive  data  for  the  studies. 

Analysis  Procedures 

The  analyses  had  4  major  components.  The  first  component  was  replication.  Nevill  et  al.’s 
(1992)  ANCOVA  procedures  were  repeated  separately  for  each  of  the  8  samples  described  in 
Table  1.  The  general  linear  model  (GLM)  procedure  in  SPSS-PC  (SPSS,  Inc.,  1998a,  1998b) 
provided  this  element  of  the  analysis.  Individual  subjects  were  the  “group”  variable;  running 
speed  was  the  covariate.  The  ANCOVA  model  employed  the  sequential  decomposition  of 
variance  that  appeared  to  have  been  used  in  the  Nevill  et  al.  analyses. 

The  second  analysis  component  introduced  an  alternative  to  the  Nevill  et  al.  (1992)  model. 
The  alternative  model  was  defined  by  changing  the  order  of  entry  of  predictors  in  the  ANCOVA. 
The  model  specification  option  of  the  GLM  procedure  was  used  to  add  individual  differences  in 
bi  first  followed  by  individual  differences  in  bo. 

Statistical  significance  and  explanatory  power  were  both  examined  as  criteria  for 
evaluating  the  results  of  these  first  2  components  of  the  analyses.  Explanatory  power  was 
expressed  as  variance  explained  adjusted  for  the  number  of  parameters  fitted.  The  adjustment 
produces  the  co2  statistic  (Hays,  1963)  instead  of  s2.  In  an  ANOVA,  s2  is  the  variance  explained 
by  a  factor  divided  by  total  variance  (i.e.,  SSFactor/SSTotai).  This  index  of  explanatory  power 
includes  some  variance  that  can  be  attributed  to  chance.  Even  if  the  factor  being  examined  had 
no  effect  on  the  dependent  variable,  the  expected  amount  of  variance  explained  would  be  greater 
than  zero.  In  particular,  the  apparent  explanatory  power  would  equal  the  degrees  of  freedom  for 
the  factor  multiplied  by  the  mean  squared  error  (i.e.,  t//:|5MSrCs)-  The  co  statistic  estimates 
explanatory  power  adjusting  for  chance  differences.  In  particular,  co2  =  [SSFactor- 
(^f'MSresjj/SSxotai-  If  MSres  >  0,  then  co  <  s  .  However,  co  arguably  is  an  estimate  of  the 
systematic  variance  explained  by  the  factor.  The  distinction  between  variance  attributable  to 
chance  and  systematic  variance  was  important  in  the  present  analyses  because  of  the  large 
number  of  degrees  of  freedom  associated  with  individual  differences  in  the  slope  and  the 
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intercept.  The  raw  variance  explained  by  those  differences  could  be  substantial  even  if  these 
factors  essentially  explained  chance  amounts  of  variation. 

The  third  analysis  component  examined  the  degree  of  confounding  of  individual 
differences  in  bo  and  bi.  Linear  regression  equations  were  computed  for  each  individual  in  each 
sample.  The  bo  and  bi  parameter  estimates  were  recorded  for  each  person.  The  Pearson  product- 
moment  correlation,  r,  was  computed  to  quantify  the  covariation  of  these  parameters. 

The  fourth  analysis  component  was  a  set  of  hierarchical  linear  models  perfonned  with  the 
HLM5  computer  program  (Raudenbush,  Bryk,  Cheong,  &  Congdon,  2001).  The  predictors  in 
these  models  were  running  speeds  nested  within  subjects,  subjects  nested  within  studies,  and 
studies.  The  multilevel  models  tested  the  hypotheses  that  bo  and  bi  varied  significantly  between 
individuals  after  controlling  for  study  differences  (cf.,  Raudenbush  &  Bryk,  2002).  The  model 
was  computed  with  speed  centered  about  the  mean  running  speed  which  was  233.08  m-min'1. 
Centering  is  routinely  used  in  multilevel  modeling  to  reduce  confounding  between  predictors.  In 
this  case,  it  was  necessary  to  reduce  the  confounding  of  bo  and  bi  estimates.  The  effects  of 
centering  must  be  taken  into  account  when  interpreting  the  data. 

Results 

Nevill  et  al.’s  (1992)  findings  generally  replicated  when  their  analysis  methods  were 
followed.  The  relevant  results  are  shown  in  columns  A,  B,  and  Di  of  Table  2.  Column  A 
indicates  that  the  pooled  regression  line  accounted  for  -83%  of  the  variance  in  ml02  (range  = 
54%  -  92%).  Column  B  indicates  that  individual  differences  in  bo  accounted  for  an  additional 
-11%  of  the  variance  (range  =  3%  to  28%).  Individual  differences  in  fy  added  an  average  of 
-1%  of  the  variance  (range  =  0%  -  2%)  to  the  explanatory  power  of  the  model  (Column  Di). 

One  aspect  of  the  Nevill  et  al.  (1992)  findings  did  not  replicate.  The  variance  explained  by 
individual  differences  in  bi  was  statistically  significant  (p  <  .05)  in  only  1  of  8  samples 
(Williams  et  al.,  1991).  The  variance  explained  in  this  case  actually  was  less  than  average 
(0.4%).  The  overall  model  accounted  for  nearly  all  of  the  variance,  so  the  small  residual  made 
even  weak  effects  significant.  The  same  was  true  for  Nevill  et  al.  (1992),  so  bi  differences  had 
little  explanatory  value  even  when  they  were  statistically  significant. 
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Table  2.  Summary  of  ANCOVA  Results 

2 

f2“  for  Model:  Significance: 


A 

Ck 

D,e 

D/ 

D, 

D, 

Costill 

.916 

.033 

.034 

.013 

.012 

.069 

.086 

di  Prampero 

.779 

.053 

.060 

.020 

.013 

.223 

.289 

di  Prampero 

.546 

.283 

.297 

.024 

.010 

.200 

.323 

Morgan 

.808 

.176 

.179 

.001 

.000 

.414 

.754 

Morgan 

.784 

.206 

.208 

.004 

.001 

.146 

.308 

Thorstensson 

.870 

.113 

.111 

.000 

.000 

.630 

.454 

Thorstensson 

.873 

.066 

.061 

.007 

.011 

.340 

.271 

Williams 

.863 

.131 

.130 

.004 

.005 

.000 

.000 

Nevill 

.792 

.182 

.004 

.023 

Average 

.827 

.109 

.112 

.010 

.008 

Note.  Model  A  =  Common  regression  line;  Model  B  =  Individual  differences  in  intercepts; 
Model  C  =  Individual  differences  in  slopes;  Model  D  =  Individual  differences  in  slopes  and 
intercepts. 

“Relative  to  null  model. 
bRelative  to  Model  A. 
cRelative  to  Model  B. 

dThe  actual  value  (-0.15)  presumably  represented  the  effects  of  chance  sampling  variation  about 
a  true  value  of  0.00. 

eNevill  et  al.  (1992)  excluded  to  base  comparisons  on  the  same  set  of  observations. 


Table  2  also  describes  the  results  of  an  alternative  sequence  suitable  for  testing  the  fan 
model.  This  sequence  began  with  the  same  pooled  group  regression  (Column  A).  Adding 
differences  in  bi  at  the  second  step  accounted  for  -1 1%  of  the  variance  (Column  C).  Adding 
differences  in  bo  as  the  third  model  element  accounted  for  -0.8%  of  the  variance.  The  bo 
differences  had  statistically  significant  explanatory  power  in  only  1  of  the  8  samples  (Williams  et 
al.,  1991). 

Confounding  of  Individual  Differences 

The  results  summarized  in  Table  2  were  consistent  with  the  suggestion  that  individual 
differences  in  bi  correlate  with  individual  differences  in  bo.  The  combined  explanatory  power  of 
bo  and  bi  differences  was  -12%.  Almost  all  of  this  explanatory  power  (1 1%)  could  be  accounted 
for  by  either  model  component.  This  clear  and  substantial  overlap  in  explanatory  power  can  be 
explained  by  the  strong  association  between  bo  and  bi  ( r  =  -.935).  The  association  was  slightly 
stronger  in  the  HLM  analysis  with  uncentered  data  (r  =  -.961).  The  difference  between  these  two 
estimates  of  the  degree  of  overlap  probably  can  be  attributed  to  the  fact  that  the  HLM  analyses 
included  an  adjustment  for  the  effect  of  differences  between  studies. 
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HLM  Results 

A  fixed-effects  model  is  one  element  of  hierarchical  linear  models.  In  the  present  case,  this 
is  the  model  that  would  be  adopted  if  there  were  no  evidence  of  individual  or  study  differences  in 
the  regression  parameters.  This  element  of  the  model  was  the  equation: 

mlCV  =  45.30  +  ,208*(s  -  233.08) 

The  subtraction  of  233.08  m/min  (-14  km-hr'1)  from  speed  centered  the  analysis  on  the  grand 
mean.  The  intercept  in  the  centered  analysis  is  the  predicted  111IO2  at  the  mean  speed.  The 
intercept  for  the  raw  data  can  be  obtained  by  solving  the  equation  for  s  =  0.  The  raw  data 
intercept  was  3.18  mlkg" 1  -min"1 . 

The  random-effects  components  of  the  model  indicated  that  individual  and  study  were 
significant  sources  of  variation  in  the  regression  coefficients.  Individual  differences  in  bi  and  bo 
were  random-effects  components  of  the  model.  Differences  in  bi  were  significant  (y  =  81.61,  55 
df,p<  .012).  Differences  in  b0  only  approached  significance  (y2  =  69.27,  55  dfp  <  .094).  Study 
differences  were  significant  for  both  b0  (y2  =  81.33,  4  df,p<  .001)  and  bi  (y2  =  20.68,  4  df,p< 
.002). 


Discussion 

The  present  analyses  are  consistent  with  a  modified  fan  model  for  individual  differences  in 
mlCk  at  submaximal  running  speeds.  This  model  is  preferable  to  the  parallel  lines  model  implied 
by  Nevill  et  al.’s  (1992)  findings.  The  fan  model  would  be  modified  to  include  a  non-zero  value 
for  mlCk  when  s  =  0.  The  following  discussion  elaborates  on  these  basic  points. 

The  conclusion  that  a  modified  fan  model  is  appropriate  rests  on  several  considerations. 

The  fact  that  Nevill  et  al.’s  (1992)  findings  only  partially  replicated  is  one  factor.  With  regard  to 
that  replication,  the  present  analyses  showed  that  a  common  regression  model  that  applied  to  all 
participants  was  a  good  first  approximation  model  in  every  study.  The  replication  extended 
further  to  the  demonstration  that  adding  individual  differences  in  bo  produced  a  substantial 
average  improvement  in  model  accuracy.  The  replication  was  completed  by  evidence  that  adding 
individual  differences  in  bi  produced  only  a  trivial  improvement  in  predictive  accuracy.  While 
this  last  increment  of  variance  explained  was  statistically  significant  in  Nevill  et  al.’s  study,  it 
was  statistically  nonsignificant  in  7  of  8  samples  analyzed  here.  The  consistent  weakness  of  the 
bi  effects  argued  for  a  model  with  a  common  regression  slope,  but  different  intercepts.  In  other 
words,  the  evidence  supported  a  parallel  lines  model  when  the  analyses  replicated  the  Nevill  et 
al.’s  procedures. 

The  examination  of  the  alternative  statistical  model  gave  reason  to  consider  rejecting  the 
parallel  lines  model.  Analyses  guided  by  the  fan  alternative  showed  that  substituting  differences 
in  bi  for  differences  in  b0  produced  slightly  better  predictive  accuracy  (-0.3%)  at  the  second  step 
in  the  model.  Adding  differences  in  bo  at  the  third  step  produced  a  trivial  gain  in  predictive 
accuracy.  The  predictive  power  of  the  fan  and  parallel  lines  models  was  nearly  identical  because 
differences  in  bo  and  bi  were  very  strongly  related  (r  =  -.935). 
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The  fan  and  parallel  lines  models  were  the  only  viable  alternatives.  These  models  were 
roughly  equivalent  when  evaluated  by  the  combined  criteria  of  explanatory  power  and 
simplicity.  Other  alternative  models  were  less  satisfactory  by  these  criteria.  A  model  in  which 
both  bo  and  bi  varied  would  require  many  more  parameters  with  a  trivial  gain  in  explanatory 
power.  A  model  with  both  b0  and  bi  assigned  fixed  values  that  applied  to  all  individuals  would 
be  less  complex  (i.e.,  only  2  parameters),  but  it  would  have  about  1 1%  less  explanatory  power 
than  either  the  fan  or  parallel  lines  models. 

The  fan  model  is  preferable  to  the  parallel  lines  model.  The  HLM  analyses  provided  the  strongest 
empirical  basis  for  preferring  the  fan  model.  The  variance  of  bi  across  subjects  was  large  enough 
to  reject  the  null  hypothesis  at  the  traditional  p  <  .05  significance  level.  The  variance  in  bo 
approached  but  did  not  reach  this  significance  criterion.  The  slightly  greater  average  predictive 
accuracy  (-0.3%)  is  a  secondary  empirical  justification  for  this  preference. 

The  fan  model  also  is  preferable  to  the  parallel  lines  model  on  conceptual  grounds. 
Differences  in  bi  correspond  to  the  conceptual  definition  of  RE.  Differences  in  bo  lack  a  clear 
correspondence  to  any  existing  construct.  These  differences  might  be  equated  with  basal 
metabolism  or  resting  energy  expenditure,  but  there  is  good  reason  to  dismiss  these  possibilities. 
The  analyses  indicated  that  60%  of  the  b0  estimates  were  negative;  40%  were  <  -5  ml-kg-min. 
These  negative  bo  estimates  do  not  have  a  plausible  physiological  interpretation.  Other  things 
equal,  a  model  that  includes  a  parameter,  bi,  that  corresponds  to  a  physiologically  meaningful 
construct,  RE,  is  preferable  to  one  that  requires  some  convoluted  explanation  for  cases  in  which 
b0  <  0. 


Adopting  the  fan  model  retains  the  RE  construct  as  a  legitimate  concept,  but  the  current 
form  of  the  model  also  implies  that  standard  methods  of  measuring  RE  should  change.  RE 
measures  should  adjust  for  b0  by  subtracting  3.18  ml-kg'1 -min'1  from  the  measured  mlCT.  In  a 
review  of  studies  that  measured  RE  in  201  samples,  8%  of  the  studies  used  a  fixed-value 
adjustment,  usually  5  ml-kg'1 -min'1  based  on  Medbo,  Mohn,  Tabata,  Bahr,  Vaage,  and  Sejersted 
(1988).  The  present  findings  therefore  support  a  shift  in  emphasis  rather  than  an  entirely  novel 
approach  to  RE. 

RE  measures  based  on  net  oxygen  uptake  (i.e.,  the  adjusted  fan)  raise  a  number  of  research 
questions. 

•  Are  individual  differences  in  b0  really  the  product  of  chance?  Repeated  measures 
designs  such  as  those  used  by  Morgan  et  al.  (1994)  and  Williams  et  al.  (1991)  could 
answer  this  question.  The  data  from  these  designs  could  be  used  to  compute  bo  and  bi 
on  multiple  occasions.  Stable  individual  differences  in  b0  would  argue  that  these 
differences  are  not  the  results  of  chance. 

•  If  bo  is  a  constant,  what  is  the  correct  value?  Values  of  3.18  ml-kg  -min'1  and  5.0 
ml-kg'1 -min'1  were  mentioned  previously;  published  regression  equations  relating 
mlCk  to  running  speed  (Leger  &  Mercier,  1984;  Nevill  et  al.,  1992)  provide  a  range  of 
other  options.  Research  is  needed  to  better  define  the  required  adjustment. 

•  Does  adjustment  increase  validity?  Studies  that  compared  adjusted  and  unadjusted  RE 
measures  as  correlates  of  running  performance  or  other  relevant  variables  could 
answer  this  question.  No  such  studies  were  identified  in  a  recent  extensive  review  of 
the  relevant  literature. 
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Study  design  effects  on  RE  also  merit  further  study.  The  HLM  analyses  showed  large 
differences  between  studies.  Each  study  is  a  complex  configuration  of  factors  that  could 
contribute  to  these  differences.  A  sample  of  5  studies  is  too  small  to  explore  this  variation. 
Further  study  would  be  worthwhile  to  determine  what  factors  affect  the  RE  estimates. 

This  replication  and  extension  of  Nevill  et  al.’s  (1992)  inquiry  leads  to  the  conclusion  that 
RE  should  be  estimated  from  the  net  oxygen  cost  of  running.  The  net  cost  approach  subtracts  a 
basal  value  for  oxygen  uptake  from  the  measured  value.  The  net  value  arguably  reflects  the 
actual  cost  of  the  activity  involved  in  running  (McArdle  et  ah,  2001).  The  adjustment  will  yield 
unbiased  RE  estimates.  Studies  comparing  the  validity  of  adjusted  and  unadjusted  RE  estimates 
would  be  a  very  useful  extension  of  this  investigation. 
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